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Abstract 

We review the Consistent Amplitude approach to Quantum Theory 
and argue that quantum probabilities are explicitly Bayesian. In this 
approach amplitudes are tools for inference. They codify objective in- 
formation about how complicated experimental setups are put together 
from simpler ones. Thus, probabilities may be partially subjective but 
the amplitudes are not. 



1 Introduction 

Whether Bayesian probabilities - degrees of belief - are (1) subjective or (2) 
objective or (3) somewhere in between, is still a matter of controversy. I vote 
for (3). Probabilities will always retain a "subjective" element because translat- 
ing information into probabilities involves judgments and different people will 
inevitably judge differently. On the other hand, not all probability assignments 
are equally useful and it is plausible that what makes some assignments better 
than others is that they represent or reflect some objective feature of the world. 
One might even say that what makes them better is that they are closer to the 
"truth" . Thus, probabilities can be characterized by both subjective and ob- 
jective elements and, ultimately, it is their objectivity that makes probabilities 
useful. 

Quantum mechanics adds an interesting twist because the recipe for calcu- 
lating quantum probabilities is inflexible: apply Born's rule to the amplitudes. 
There seems to be no room for judgments here. One possibility is that the 
objective nature of the wave function extends also to the quantum probabili- 
ties. In this view quantum and Bayesian probabilities differ in an essential way: 
quantum probabilities are not Bayesian. A second possibility is that it is the 
other way around and it is the subjectivity of the probabilities that infects the 
wave functions. The purpose of this paper is to argue for a third possibility. 

* Invited paper presented at the International Conference on Foundations of Probability 
and Physics — 4 (Vaxjo University, Sweden, 2006). 
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Our goal here is to review the main ideas behind the Consistent Amplitude 
approach to Quantum Theory (CAQT) PPO in order to argue that there is no 
need to distinguish between quantum and classical probabilities. In the CAQT 
approach probabilities are explicitly Bayesian - they reflect the degree to which 
we believe that detectors will successfully detect - and yet, there is nothing 
subjective about the wave function that conveys the relevant information about 
the (idealized) experimental setup. The situation here is quite analogous to 
assigning Bayesian probabilities to the outcomes of a die toss based on the 
objective information that the (idealized) die is a perfectly symmetric cube. 
The probabilities may be partially subjective, but the information on which 
they are based is intended to represent an objective feature of the world. 

Many discussions on the foundations of quantum theory start from the ab- 
stract mathematical formalism of Hilbert spaces and some postulates or rules 
prescribing how the formalism should be used. Their goal is to discover a suit- 
able interpretation. The CAQT is different in that it proceeds in the opposite 
direction: first one specifies the objective the theory is meant to accomplish and 
then a suitable formalism is derived from a set of "reasonable" assumptions. In 
a sense, the interpretation is the starting point and the formalism is derived 
from it. 

The objective of the CAQT is to predict the outcomes of certain idealized 
experiments on the basis of information about how complicated experimental 
setups are put together from simpler ones. The theory is, by design, a theory of 
inference from available information. The "reasonable" assumptions are four. 
The first specifies the kind of setups about which we want to make predictions. 
The second assumption establishes what is the relevant information on the basis 
of which the predictions will be made and how this information is codified. It 
is at this stage that amplitudes and wave functions are introduced as tools for 
the consistent manipulation of information. The third and fourth assumptions 
provide the link between the amplitudes and the actual prediction of experi- 
mental outcomes. Although none of these assumptions refer to probabilities, 
all the elements of quantum theory, including indeterminism and the Born rule, 
Hilbert spaces, linear and unitary time evolution, are derived. 

2 Setups and amplitudes 

The first and most crucial step is a decision about the subject matter. We 
choose a pragmatic, operational approach: the objective of quantum theory is 
to predict the outcomes of experiments carried out with certain idealized setups. 

Next we note that if setups are related in some way - perhaps two setups 
can be connected to build a third one - then information about one may be 
relevant to predictions about the other. This suggests that one should identify 
the possible relations among setups. To avoid irrelevant technical distractions 
we consider a very simple quantum system, a "particle" that lives on a dis- 
crete lattice and has no spin or other internal structure. (In fact, whether the 
system is a particle or a wave or both or neither is immaterial; the word 'par- 
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Figure 1 : (a) The simplest setup [x f,Xi\. (b) Two successive setups [a; / , x'] 
and [ combined with the and operation, (c) Two setups in parallel, 

] and [xf, x" , Xi] are combined with the OR operation. 



ticle' is selected only because a word, some word, must be used to refer to the 
system.) The generalization to more complex configuration spaces should be 
straightforward. 

The simplest experimental setup, denoted by [x/,Xj], is shown in Fig. 1(a). 
It consists of a source at the space-time point Xj — (xi,ti) and a detector at 
Xf = (xf,tf). Both Xi and Xf are assumed to be points on a discrete lattice 
and the intermediate region may be filled by various kinds of fields. In Fig. 1(b) 
we show a slightly more complex setup that we denote by [xf, x', x»] where the 
source is at Xi , the detector at xj , and we have introduced a "filter" that blocks 
all paths from xi to Xf except via the intermediate point x' . The possibility 
of introducing many filters each with several holes (as in Fig. lc) leads to 
allowed setups of the general form a — [xf, s n , s„_i, . . . ,s 2 , Si,Xi] where each 
s n = (x n ,x' n ,x", . . .) represents a filter at time t n , intermediate between ti and 
tf, with multiple holes at x n , x' n , x^, . . . 

There are two basic kinds of relations among setups. The first, called and, 
arises when two setups a and b are placed in immediate succession resulting in a 
third setup which we denote by ab. It is necessary that the destination point of 
the earlier setup coincide with the source point of the later one, otherwise the 
combined ab is not allowed. For example, [xf, x'][x' , Xi] — [xf,x',Xi] is shown 
in Fig. 1(b). With the single setup [xf,x',Xi] we can perform several different 
experiments. We can place a source at Xi and the detector at x' in which case we 
are only using the [x 1 , Xi] component; or we can place the source at x' and the 
detector at Xf and use only the [x/,x'] component; or we can place a source at 
Xi, the detector at x/, and use the whole thing [x/,x', Xj]. Although we can only 
do one of these experiments at a time, the three experiments are clearly related 
and knowing something about one setup may be helpful in making predictions 
about another. 

The second relation, called OR, arises from the possibility of opening addi- 
tional holes in any given filter. Specifically, when (and only when) two setups 
a' and a" are identical except on one single filter where none of the holes of a' 
overlap any of the holes of a", then we may form a third setup a, denoted by 



a' V a" , which includes the holes of both a' and a". The example 

[Xf, (x',x"),Xi] = [Xf,x',Xi] V [Xf,x",Xi] (1) 

= {[x f ,x')[x',x i ))W{{x f ,x"}\x",x i ]) (2) 

is shown in Fig. 1(c). Again we see that we can perform a wide variety of related 
experiments with the same setup by judicious placement of source and detector 
and by blocking or not the appropriate holes. 

Provided the relevant setups are allowed the basic properties of AND and 
OR are quite obvious: OR is commutative, but and is not; both AND and OR 
are associative, and finally, AND distributes over OR but not vice-versa. One 
should emphasize that these are physical rather than logical connectives. They 
represent our idealized ability to construct more complex setups out of simpler 
ones and they differ substantially from their Boolean and quantum logic counter- 
parts. In Boolean logic not only and distributes over OR but OR also distributes 
over and, while in quantum logic propositions refer to quantum properties at 
one time rather than to processes in time. 

Thus, our first assumption is 

Al. The goal of quantum theory is to predict the outcomes of experiments 
involving setups built from components connected through AND and OR. 

Having identified relations among setups we now seek a way to handle them 
quantitatively. A representation of and/or can be obtained by assigning a 
complex number (f>(a) to each setup a in such a way that relations among setups 
translate into relations among the corresponding complex numbers. What gives 
the theory its robustness, its uniqueness, is the requirement that the assignment 
be consistent: if there are two different ways to compute 4>(a) the two answers 
must agree. The remarkable consequence of requiring consistency is embodied 
in the following 

Regraduation Theorem: Given one consistent representation of and/or in 
terms of complex numbers </>(a), one can "regraduate" with an appropriately 
chosen function ip to obtain an equivalent and more convenient assignment, 
ip(a) = ip(<fi(a)), so that and and OR are respectively represented by multipli- 
cation and addition, 

i/j (ab) = ip (a) ip (b) and ^(aVa')=^ (a) + ip (a') . (3) 

Complex numbers assigned in this way will be called amplitudes. The theorem 
is proved in ^ [2J. (For an independent earlier derivation see ref. [S]-) 

It would be interesting to explore the consequences of keeping track of more 
information about the setups by using representations of and/or in terms of 
more complicated mathematical objects. It is likely that the resulting theory 
would differ from quantum mechanics in important ways. For our current pur- 
poses however we restrict ourselves to a representation in terms of amplitudes. 
This is our second assumption, 

A2. The relevant information for predicting the outcome of an experiment with 
a setup a is codified into its complex amplitude ip(a). 
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3 Wave functions and time evolution 



The observation, already expressed by Feynman [jy, that leads to the funda- 
mental evolution equation (and eventually to path integrals) is that a filter that 
is totally covered with holes is equivalent to having no filter at all, 

[xf,Xi]= Y ([xf,x t }[xt,x l ]) . (4) 

all x at t 

In terms of the corresponding amplitudes, this is quantitatively expressed as 

ip(xf,Xi)= ^ 4>(xf,Xt)ip(x t ,Xi). (5) 

all x at t 

Since we are interested only in the response of the detectors we find that 
in there are situations where amplitudes keep track of too much information. 
Indeed, note that there are many possible combinations of starting points Xi 
and of interactions prior to the time t that will result in identical evolution after 
time t. What these different possibilities have in common is that they all lead to 
the same numerical value for the amplitude ip(x t , Xi). It is therefore convenient 
to set ^(x, t) = tp(xt,Xi) and omit all reference to the irrelevant starting point 
X{. The object thus introduced, ^(x, i), is called a wave function. The usual 
language is that ^>(x,t) describes the state of the particle at time t. In the 
CAQT approach we say that \&(x, t) encodes information about those features 
of the setup prior to t that are relevant to time evolution after t. 

The evolution equation can then be written as 

= J2 ^{x f ,tf,x,t)^(x,t), (6) 

all x at t 

which is equivalent to a linear Schrodinger equation as can easily be seen pP [3] 
by differentiating with respect to i/ and evaluating at tf = t. Thus, a quantum 
theory formulated in terms of consistently assigned amplitudes must be linear: 
nonlinear modifications of quantum mechanics must either violate assumptions 
Al or A2 or else be internally inconsistent. 

4 Hilbert space 

In the usual approach to quantum theory the linearity embodied in the principle 
of superposition and the linearity of the Schrodinger equation seem completely 
unrelated. Indeed, all proposals for a non-linear quantum mechanics have invari- 
ably maintained the former and broken the latter. Within the CAQT approach 
this is not allowed. One cannot preserve the linearity of superpositions and 
break the linearity of evolution because both follow from the same consistency 
requirement that led to the sum and product rules for amplitudes. 

The superposition principle states that if a system can be prepared in a state 
'J/' and it can be prepared in another state VP" then it can also be prepared in 
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Figure 2: Illustrating how the superposition principle follows from the sum and 
product rules for consistent amplitudes. 



a linear superposition W = a^>' + 0^". An interesting question is whether the 
superposition can actually be prepared. Given a generic pair of states and 
iff", what is the specific setup that will prepare the state a^f' + j3W7 

Within the CAQT approach this annoying question does not arise because 
we are not interested in preparing arbitrary states, but only in predicting the 
outcomes of experiments with those special setups described in assumption Al. 
For these special setups it is easy to see that linear superpositions do arise. 
For example, let ^'(x,t) = ip(x,t; x' , t ) be the wave function at time t when 
the source is at (a?, to), and let ty"(x,t) = ip(x, t; x", to) be the wave function 
when the source is at (x",to). See Fig. 2. Linear superpositions of ^'(x^t) and 
fy"(x,t) can be prepared by placing a filter at t with holes at x' and x" and 
a source at a point (x^tj) prior to t . Then the overall amplitude for this new 
setup, ip(x,t;Xi,ti), is 



where a = if>(x', to; Xi, U) and j3 = ip(x" , to; Xi,U). Notice that the complex 
numbers a and /3 can be changed at will by moving the starting point (xi,ti) 
or by introducing additional fields between t% and to- 

Thus, as a consequence of the sum and product rules, eq.(|3j|, linear super- 
positions of wave functions arise naturally and we can ask what else is needed 
for them to form a Hilbert space. The basic question is whether an equally 
compelling inner product is available. The answer is quite illuminating because 
of the support it gives to viewing quantum mechanics as a theory of inference 
rather than as a law of nature. 

An inner product defines lengths and angles. We find that without any 
additional assumptions the basic components of setups, the filters, take us a long 
way toward an inner product because they already supply us with a concept of 
orthogonality. The action of a filter P with holes at a set of points x p is to turn 
the wave function ^>(x) into the wave function P^>(x) = J2 P ^x,s p ^(x). Since 
the filters P act as projectors, P 2 = P, it follows that any given filter defines 
two special classes of wave functions. One is the subspace of wave functions 
that are transmitted by the filter without any changes, and the other is the 
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ofl>'(x,t) +/3^"(x,t) , 



(7) 
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subspace of wave functions that are completely blocked. We will define these 
two subspaces as being orthogonal to each other. 

Any wave function can be decomposed into orthogonal components, ^ = 
*p + *i_p where * P = p-fy is unaffected by the filter and = (1 - P)$ is 

blocked: P^p = ^p and P^i-p = 0. A particularly convenient expansion in 
orthogonal components is that defined by a complete set of elementary filters. 
A filter Pi is elementary if it has a single hole at x*j. It acts by multiplying *(x) 
by <%£;• The set is complete if ^ P, = 1. Then \&(x) = X^^'i^?A> wnere 

At this point it is convenient to introduce the familiar Dirac notation. In- 
stead of \I/(x) and we shall write |$) and so that |*} = Xli^l*)- The 
question is what else, in addition to orthogonality, do we need to define the inner 
product. Recall that an inner product satisfies three conditions: (a) (vf'lvE') ^ 
with (^l*) = if and only if I*) = 0, (b) linearity in the second factor, and (c) 
antilincarity in the first factor, ( < I > | V I') = (\&|<I>)*. Conditions (b) and (c) define 
the product of 1$} = Bj\j) with \^>) = J^i in terms of the product of 
\j) with 

($m=J2 B jMM ■ (8) 

i 

The orthogonality of the basis functions 5s t s i is naturally encoded into the inner 
product by setting = for i j, but the case i = j remains undetermined, 
constrained by condition (a) to be real and positive. Clearly an additional 
assumption is necessary. Here it is: 

A3. If there is no reason to prefer one region of configuration space over another 
they should be assigned equal a priori weight. 

We therefore choose equal to a constant which, without losing generality, 
we set equal to one: 

(i\j)=6 ij ^($\*) = J2B*A i , (9) 

i 

and this leads to the Hilbert norm ||*||" = (*|*) = ]T\ \A l \ 2 . The significance 
of A3 will be better understood once we explore its implications in the next 
section. 

5 Probabilities: the Born rule 

Finally, we get to the question of using the information encoded into amplitudes 
to predict the outcomes of experiments. Our argument invokes time evolution, 
eq. ©, in an essential way. This is in contrast to other derivations of the Born 
rule (such as Gleason's) which are essentially static. 

It is perhaps best to start with a simple special case. Suppose that the 
preparation procedure is such that ^/(x, t) vanishes at a certain point (x*o,i). 
Then, according to eq. 10, placing an obstacle at the single point (x*o,t) (i.e., 
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placing a filter at t with holes everywhere except at xq) has no effect on the 
subsequent evolution of ty. Since the mathematical relations among amplitudes 
are meant to reflect the corresponding physical relations among setups, it seems 
natural to assume that the presence or absence of the obstacle at Xq will have 
no observable effects. What if the obstacle was itself a detector? It is just 
as natural to assume that since the obstacle/detector inflicts no action on the 
system, then it must not itself suffer any reaction. Therefore, we conclude that 
when ^(xo,t) = the particle will not be detected at (xo,t). 

Convincing and natural as this argument might be, it is important to rec- 
ognize that an assumption was made: no amount of logic could conceivably 
bridge the gap from a mathematical representation in terms of amplitudes to 
the prediction of an actual physical event. The assumption can be generalized 
to the following general interpretative rule: 

A4. If the introduction at time t of a filter blocking those components of the 
wave function characterized by a certain property V has no effect on the 
future evolution of a particular wave function then when the wave 
function happens to be \& (t) the property V will not be detected. 

Note that A4 deals with a situation of complete certainty; no probabilities are 
mentioned. 

The deduction of the Born rule now proceeds as in ref. [2] • Briefly the idea 
is as follows. We want to predict the outcome of an experiment in which a 
detector is placed at a certain Xk when the system is in state |$) = X^^il*)- 
In [2] we showed that the state for an ensemble of N identically prepared, in- 
dependent replicas of our particle is the product \^n) — Yl a= i \^a)- Now 
we apply the interpretative rule A4. Suppose that in the TV-particle configu- 
ration space we place a special filter, denoted by Pf e , the action of which is 
to block all components of |^jv) except those for which the fraction n/N of 
replicas at Xk lies in the range from / — e to f + e. The difference between 
the states P^J^n) and \^n) is measured by the relative Hilbert distance, 
||P^|*jv) ~ I 1 ** n)\\ 2 / n\^ n) ■ The result of this calculation is 

lim \\P! S \* N ) - \^n)\\ 2 = 1- [ t+e S(f-\A k \ 2 )df , (10) 

N~>oo J ' J f _ E 

where we have normalized (^l^) = (Wjvl'S'./v) = 1. 

We see that for large TV the filter Pi s has no effect on the state \^n) provided 
/ lies in a range 2e about |^4fc| 2 . Therefore, according to A4, the state |^jv) does 
not contain any fractions outside this range. On choosing stricter filters with 
e — ► we conclude that we can predict with complete certainty that detection 
at Xk will occur for a fraction \Ak\ 2 of the replicas and that detection will not 
occur for the remaining fraction 1 — |^4fc| 2 . For any one of the identical individual 
replicas there is no such certainty; the best one can do is to say that detection 
will occur with a certain probability Pr(/c). In order to be consistent with the 
law of large numbers the assigned value must agree with the Born rule 

Pr(fc) = |^ fe | 2 . (11) 
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The important role played by the inner product, introduced through assump- 
tion A3, into ea. i(TT)|) must be emphasized. In fact, had we weighted the |i)'s 
differently and chosen a different normalization, say — Wi, the resulting 
probability would have been Pr(i) = w.^ | | 2 instead of ea. l(TT|) . 

It is instructive to explore this issue further. Nothing in our choice of a 
discrete lattice requires it to be uniform and periodic. Indeed, a non-uniform 
discrete lattice could have been defined in terms of an arbitrary frame of curvi- 
linear coordinates. If the underlying configuration space is Euclidean the use 
of curvilinear rather than cartesian coordinates is a matter of choice but in the 
general case of curved spaces curvilinear coordinates are unavoidable. 

Assumption A3 suggests that each cell i in the non-uniform lattice be 

1/2 

weighted by its own volume which we will denote by io, = g i Ax. (The 
usual notation for a volume element in arbitrary coordinates is g 1 ' 2 (x)dx where 
g(x) is the determinant of the metric tensor, and dx = dx 1 . . .dx 3 in three di- 
mensions.) In the continuum limit [3] we let Ax — > dx. Replacing w~[~ \i) = 

2 Aac) _1 |i) by\x) the completeness condition 1 = J^Pi = X^ w 4 rl |*)(*l be- 
comes 1 = f g 1 / 2 dx\x)(x\. Next, replace Sij/Ax by 5(x — x') and the inner 
product — WiSij becomes (x\x'} — g _1 / 2 <5(a; — x'). Finally, replace Ai by 
A(x) and the state |*) = X^^N) becomes |\&) = / g 1/2 dx A(x) \x). The Born 
rule, eq. i(TT|) . becomes 

Vr{dx) = g 1/2 dx\A(x)\ 2 (12) 

which shows that |A(x)| 2 is the probability density. 

It is sometimes argued that while there is an element of subjectivity in the 
nature of classical probabilities but that quantum probabilities are different, 
that they are totally objective because they are given by \A\ 2 . We have just 
shown that this assignment is neither more nor less subjective than say, assigning 
probabilities to each face of a die. Just like we assign probability 1/6 when we 
have no reason to prefer one face of the die over another, the Born rule follows, 
even in curved spaces, when we have no reason to prefer one volume element 
over another — provided their volumes are equal. 

6 Conclusion 

Probabilities may be subjective but the information on the basis of which we 
update probabilities is not supposed to be. Indeed, we want to update from a 
prior to a posterior distribution because we believe the posterior distribution will 
give us a better assessment of truth. Posterior distributions are "objectively" 
better than prior distributions. 

We have argued that amplitudes and wave functions codify objective infor- 
mation about the way experimental setups are put together. Quantum proba- 
bilities take this information into account and thereby incorporate an objective 
element - this is why they work. On the other hand, although amplitudes and 
wave functions may be representations of objective information, an element of 
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subjectivity is introduced into the corresponding state vectors and the Hilbert 
space through the specification of the inner product. It was necessary to make 
an assumption - a judgement - about the a priori weight assigned to any region 
of configuration space. 

It is possible to develop the CAQT approach further and use an entropic 
argument [3] to show that time evolution is not merely linear but that it is 
also unitary. This allows us to introduce observables other than position. It is 
also possible to consider more complex many-particle sources and detectors and 
thus generalize to many-particle systems. But these developments lie beyond the 
limited goal of this paper which was to review the argument from the objective 
information embodied in consistent amplitudes to the Bayesian probabilities of 
quantum theory. 
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